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Abstract 



This article is a short introduction to generic case complexity, which 
is a recently developed way of measuring the difficulty of a computa- 
tional problem while ignoring atypical behavior on a small set of in- 
puts. Generic case complexity applies to both recursively solvable and 
recursively unsolvablc problems. 
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1 Introduction 

Generic case complexity was introduced a few years ago |15] as a way of 
estimating the difficulty of recursively unsolvable problems in combinato- 
rial group theory More recently generic complexity and related ideas have 
proved useful in cryptanalysis of public key systems [20]. 

Combinatorial group theory has its own computational tradition extend- 
ing back for more than a century. Almost all computational problems in 
combinatorial group theory are recursively unsolvable. During the 1990's 
people working under the leadership of Gilbert Baumslag on the Magnus 
Computational Group Theory Package [19] noticed that for some difficult 
problems, simple strategies worked well in practice. Computer scientists 
had come to similar conclusions earlier in regard to NP-complete problems. 
The group theoretic version of this theme is developed in [15]. The authors 
define a generic set of inputs and show that for a large class of finitely gener- 
ated groups the word, conjugacy and membership problems can be solved in 
linear time on a generic set even though these problems might be recursively 
unsolvable for the group in question. 

The main point here is that it can be convenient and practical to work 
with a complexity measure which focuses on generic sets of inputs to al- 
gorithms and ignores sparse sets of atypical inputs. Generic complexity is 
close in spirit to the analyses by Smale [21] and Vershik and Sporyshev [23] 
of the simplex algorithm, and to errorless heuristic case complexity [2]. It 
is also close to average case complexity. Generic complexity is simpler and 
broader in scope than average case complexity. In addition it is a more 
direct measure of the performance of an algorithm on most inputs. These 
points are discussed in Section [631 

This article is an informal introduction to generic case complexity. It 
is not a complete survey of current developments. We restrict ourselves 
to polynomial time complexity, and we omit probabilistic algorithms. We 
also do not discuss generic case completeness or the emerging theory of 
problems undecidable on generic sets. These topics will be included in a 
more comprehensive treatment to appear later. 
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2 Computational problems 



2.1 Decision problems and search problems 

The usual approach to the study of computational problems is to begin with 
Turing machines and membership problems for formal languages. Member- 
ship problems are particular cases of decision problems, that is, computa- 
tional problems whose answer is "yes" or "no" . Every decision problem can 
be turned into a membership problem, but this transformation may add 
complications and even change the nature of the problem. For example rep- 
resenting graphs by words over an alphabet is possible but inconvenient and 
sometimes misleading PQ. Thus we are led to a more general view. 

Definition 2.1. A decision problem is a pair V = (L,I), where / is a 
countable set of inputs for V and L C / is the positive part of V. That is, 
the answer for input w £ / is "yes" if w £ L and "no" otherwise. 

Definition 2.2. A search problem is a pair V = (R,I x J), where I and J 
are countable sets, and R C I x J is a binary predicate. Given an element 
w E I, one is required to find v € J such that (w, v) G R; that is, such that 
R(w, v) is true. 

For example take J to be the integers, / be the set of polynomials with 
coefficients in J, and R the set of pairs (w, v) such that v is a root of w. 
Given a polynomial, w, one is required to find an integer root, v, of w. 

In the preceding example the desired v may not exist. In other search 
problems one may know in advance that v : s always exist and that the only 
task is to find one. 

When we speak of a problem V, we mean either a decision problem or a 
search problem. 

2.2 Size functions and stratifications 

Decision and search problems are solved by algorithms and partial algo- 
rithms. In general to study the complexity of an algorithm A, one compares 
the resources spent by A on input w to the size of w. In our case the re- 
source is time, or more precisely, the number of steps required for A to deal 
with w. We are being informal here. The usual formal definition is stated 
in terms of the number of steps required by a deterministic Turing machine 
which implements A. 

Definition 2.3. The time consumed by an algorithm A on an input w is 
Tj±(w), the number of steps performed by an algorithm A on the input w. 
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If A is a partial algorithm, then T4 is a partial function. Its domain is set 
on inputs on which A halts. 

Definition 2.4. A size function for a set I is a map a : / — > N, the nonneg- 
ative integers, such that the preimage of each integer is finite. 

Definition 2.5. A stratification for a set I is an ascending sequence of finite 
subsets whose union is /. 

For each size function a determines a stratification with subsets a~ 1 ({0}), 
<7 _1 ({0, 1}), . . ., and every stratification can be obtained from a size func- 
tion. The finiteness condition in Definitions 12.41 and 12.51 will be relaxed later 
when we discuss computational problems with a probability distribution on 
the set of inputs. 

The choice of the size function depends of course on the problem at hand. 
If the input w is a natural number, its size may be taken to be the number 
of symbols in its representation of to a certain base. For any two integer 
bases greater than 1 the corresponding sizes are about the same; they differ 
by at most a fixed multiplicative factor. However if w is written down in 
unary notation, that is, as a sequence of w l's, its size will be exponentially 
greater than its size with respect to the bases greater than 1. 

Consider another example. An input for the satisfiability problem, SAT, 
is a boolean expression in conjunctive normal form. There is a standard 
way to write such an expression as a word over a finite alphabet. A natural 
choice for input size is the length of that word. But if we are taking the 
trouble to find difficult instances of SAT, it might be reasonable to take 
the time needed to generate the word as its size instead. It is also worth 
noting that converting an arbitrary formula to CNF may increase its length 
exponentially. 

It is easy to obtain surprising complexity bounds by choosing artificial 
size functions. We do not pursue further here the question of defining pre- 
cisely what is a reasonable size function; nevertheless we trust the reader 
will agree that the size functions which appear below are reasonable. 

2.3 Worst case complexity 

Definition 2.6. Let A be an algorithm, T4 its time function, / the set 
of inputs, and a a size function for /. The worst case complexity of A 
with respect to a is the function WCj± : iV — > N defined by WCj[(n) = 
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We are usually not interested in the precise worst case complexity but 
rather in estimating its rate of growth. We say that a problem V has poly- 
nomial worst case complexity if it is solved by an algorithm A for which 
WCj^{n) is 0(n k ) for some k. To define exponential worst case complexity, 
we replace the condition 0(n k ) for some k by 0(2 n£ ) for some e > 0. We 
write V € P and V € E respectively. 

Worst case complexity was the first and is still the most commonly used 
complexity measure. When an algorithm has low worst case complexity, say 
Cj^{n) is 0(n 2 ), we can be pretty sure that it is practical. But the converse 
is not true because the worst cases, which determine WCa, may be rare. 
This phenomenon has been well known since the 1970's. 

The simplex algorithm for linear programming is frequently used as an 
example of an algorithm for which hard inputs are rare. The algorithm 
is used hundreds of times daily and almost always works quickly. But it 
has been shown by V. Klee and G. Minty [14] that there are hard inputs. 
More precisely, the simplex algorithm is in E — P. Khachiyan devised an 
ingenious polynomial time algorithm for linear programming problems [13j . 
but the simplex algorithm continues to be widely used because the hard 
inputs never occur in practice. Vershik and Sporyshev [23] and Smale |21| 
showed independently that the simplex algorithm runs in linear time on a 
set of inputs of measure one. 

Sometimes we want a problem to be difficult. This is the case when 
we are looking for a computational problem on which to base a public key 
cryptosystem. Solving an instance of the problem is equivalent to breaking 
the cryptosystem for a particular key choice, so we want the computational 
problem to be hard almost all the time (of course it should be easy if you are 
in possession of certain secret information, otherwise it would be impossible 
to decode messages). In this situation the worst case behavior of algorithms 
is irrelevant. 

Worst case complexity is not defined for partial algorithms, because they 
do not always halt. Nevertheless it can be the case that the nonhalting in- 
stances are rare for a partial algorithm just as hard instances can be rare for 
an algorithm. Coset enumeration (probably the first mathematical proce- 
dure to be programmed on a computer) is an example of a partial algorithm 
which is useful in practice for solving instances of a recursively unsolvable 
problem, namely whether or not a given finite presentation present a finite 
group. 

In the next section we propose a new complexity measure, generic case 
complexity, which applies to partial algorithms as well as to algorithms. 
Subsequently we will discuss the extent to which generic case complexity 
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overcomes the deficiencies of worst case complexity and the relation between 
generic case and average case complexity. 

3 Generic case complexity 

Generic case complexity is an attempt to deal with the fact that worst case 
complexity can be unsatisfactory when the difficult inputs to an algorithm 
are sparse and not observable in practice. The main idea is to ignore small 
sets of difficult inputs and consider the worst case complexity on the re- 
maining large set of more tractable inputs. By large we mean generic as 
defined below. The treatment here will be generalized when we discuss 
distributional problems in Section [6] 

3.1 Asymptotic density 

Definition 3.1. Let / be a set of inputs with size function a. Define B n , the 
ball of radius n, by B n = {w \ w £ I, a{w) < n}. A subset R C / is said to 
have asymptotic density a, written p(R) = a, if lim^-joo \R n B n \/\B n \ = a 
where |X| denotes the size of a set X. If R has asymptotic density 1, it is 
called generic; and if it has asymptotic density 0, it is negligible. 

Recall from Definition 12.41 that a~ 1 (n) is always finite. Some authors 
use limsup rather than lim in Definition 13. 11 

Asymptotic density can be defined using spheres in place of balls. The 
sphere of radius n is I n = {w \ w G I, a{w) = n}, that is, the set of inputs 
of size n. We say spherical density and volume density to distinguish the 
two definitions, and we write p° for spherical density. 

Lemma 3.2. Keep the notation of Definition \3. 1\ If almost all spheres are 
nonempty and p°(K) exists, then so does p(K), and the two are equal. 

Proof. Set x n = \K n B n \ and y n = \B n \. Then y n < y n +i for almost all n, 
and limy n = oo. By Stolz's theorem 

p(K) = lim — = lim — = lim J — = p°(K). 

n->oo y n rwoo y n - y n _ x n^oo |£ n | 

□ 

3.2 Convergence rates 

A generic subset of inputs is asymptotically large. Whether it appears large 
in practice depends on how fast the limit in Definition 13.11 converges. 
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Definition 3.3. Let R be a subset of /, and suppose that the asymptotic 
density p(R) exists. The function 5r(ti) = \R PI B n \/\B n \ is called the fre- 
quency function for R. 

Definition 3.4. Suppose R C I and <5r is the density function of R. We 
say that R has asymptotic density p(R) with superpolynomial convergence 
if \p{R) — <5r(h)| is o(n~ k ) for every natural number k. For short we call a 
generic set with superpolynomial convergence strongly generic. Its comple- 
ment is strongly negligible. 

Of course one can introduce exponential convergence, superexponential 
convergence, etc. In the original papers [15} [T6] strong genericity was re- 
served for generic sets with exponential convergence, but seems that super- 
polynomial convergence is fast enough to obtain the same results. 

3.3 Generic case complexity of algorithms and problems 

In this section we come to the main notion of the paper. 

Definition 3.5. Let V be a problem. A partial algorithm, A, for V gener- 
ically solves V if the halting set, H^, of A is a generic subset of the set of 
inputs, /. In this case we say that V is generically solvable. 

In particular any algorithm for V generically solves V . We will see that 
a generically solvable problem may be recursively unsolvable. 

Definition 3.6. Let V be a problem with size function a, and let A a partial 
algorithm for V . A function / : N — > N is a generic upper bound for A if 
the set Hj^j = {w 6 / | w £ H_a and T_a(w) < f(a(w))} is generic in /. If 
Haj is strongly generic, then / is a strongly generic upper bound. 

Now we are ready to define generic complexity classes of algorithmic 
problems. 

Definition 3.7. A problem V is generically solvable in polynomial time if 
there exists a partial algorithm for V with a polynomial generic upper bound. 
If there exists a partial algorithm with a polynomial strongly generic upper 
bound, then V is strongly generically solvable in polynomial time. 

For short we refer to problems in two classes defined above as generically 
polynomial and strongly generically polynomial respectively. We denote the 
two classes by GenP and SGP. 

It is clear that one can extend Definition 13.71 to other time bounds. 
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4 Complexity of unsolvable and NP-complete prob- 
lems 

Analysis of some unsolvable and NP-complete decision problems yields in 
each case an easy partial algorithm with a generic halting set. We present a 
few of these results here. A more thorough treatment with complete proofs 
will appear later. 

4.1 The halting problem 

Theorem 4.1 ([10]). The halting problem for deterministic Turing ma- 
chines with semi-infinite tape and tape alphabet {ao,ai} is generically solv- 
able in polynomial time; that is, it is in GenP. 

We do not know whether similar results hold for Turing machines with 
bi-infinite tapes. 

The halting problem is the premier recursively unsolvable problem. For 
Turing machines with semi-infinite tape extending to the right it is required 
to decide whether or not a given Turing machine halts when started at the 
leftmost square of a tape filled with ao's. The set of inputs, /, is the set 
of Turing machines of the given type. Recall that a Turing machine M 
satisfying the conditions of Theorem 14.11 may be thought of as a map 

p : {1, 2, . . . , n} x {a , ai} -► {0, 1, 2, . . . , n} x {a , ai} x {L, R} (1) 

where 1 is the initial state of M, is the halting state, and {2, . . . , n} are the 
other states. The meaning of p(i, a r ) = (j, a s ,L) is that if M is in state i and 
scanning a square containing a r , then its next move is to overwrite a r with 
a s , move left on the tape, and go to state j. Likewise p(i,a r ) = (j,a s ,R) 
has the same effect except that M moves right instead of left. If M reaches 
state 0, there are no further moves to make, and M halts. As the tape for 
M extends infinitely far to the right but not to the left, M may attempt to 
move off the tape to the left. In this case the move is not completed, and 
M crashes. 

The map p may be thought of as a program. The states are the numbers 
of the instructions, and the instruction, except for instruction number 0, 
tells what to do depending on which letter of the tape alphabet is currently 
being scanned. Instruction halts the machine. 

We take the inputs, /, to be the set of programs p defined above, and 
the size of p is defined to be the the number of non-halting states (of which 
there must be at least one); I n is the set of programs with n non-halting 
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states. Here is a polynomial time partial algorithm which decides the halting 
problem. 

Algorithm 4.2. Input a program p 

Run p until the first time it repeats a state 

If p halts, say "Yes" 

If p crashes, say "No" 

Else loop forever. 

It would be more informative to say "Don't know" than to loop forever, 
but accommodating this possibility would slightly complicate the definitions 
in Section 13.31 We have opted for simplicity. 

Algorithm 14.21 is obviously polynomial time on its domain and clearly 
correct. Its domain, D, is the set of programs which either halt or crash 
before repeating a state. It remains only to show that D is generic. We 
sketch the argument. 

Let D' be the the set of programs which crash before repeating a state. 
Since D' C D, it is enough to show that D' is generic. We can easily count 
the number of programs in I n , the sphere of radius n; \I n \ = (4n) 2n for 
n > 1 . Thus by Lemma 13.21 we are free to use spherical density, p° instead 
of volume density, p. 

Consider the programs in the sphere I n . Half of them have L in their 
first instruction, and the other half have R. Thus half the programs in I n 
crash immediately, and the other half move right to square 1 and transfer 
from state 1 to another state. There are n — 1 non-halting states besides 
state 1. Thus the proportion of programs in I n which do not halt or repeat 
states at the first step is \ + ^^qrj-- 

Let Cfc C / be the subset of programs not repeating states or halting 
within the first k moves. The preceding discussion shows that p°(C\) = 1. 
Further analysis yields p°{Ck) = 1 for all k. 

Programs in Cf. move to a new non-halting state for each of their first k 
steps. At each of these steps half the remaining programs (those which have 
not previously crashed) move left on the tape and half move right. Thus for 
n > k the proportion of programs in Ck H I n which do not crash in their 
first k steps is the same as the fraction of random walks of length k on the 
integers which start at and reach only nonnegative numbers. By known 
results that fraction goes to as k goes to infinity. 

Fix e > 0. If A; is large enough, the fraction of random walks of length k 
which avoid negative numbers is less than e. Thus for n > k the proportion 
of programs in D I n which do not crash in their first k steps is also less 
than e. But for n large enough, \Ck H I n |/|/ n | > 1 — e because p°{Ck) = 1. 
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Hence the fraction of programs in I n which crash without repeating a state 
is at least (1 — e) 2 . Consequently p°(D') = 1 as desired. 

4.2 The Post Correspondence Problem 

The set of inputs for the Post Correspondence Problem all finite sequences 
of pairs of words (u\,vi) . . . (u n ,v n ), n > 1, over a fixed finite alphabet 
{a±, . . . , afc}, k > 2. The output is "Yes" if ■ ■ ■ Ui m = ■ ■ ■ vi m for some 
sequence of indices of length m > 1 and "No" otherwise. We define I n to be 
the collection of inputs with n pairs of words of length between 1 and n. 

It is well known that PCP is recursively unsolvable. Nevertheless there 
is a trivial partial algorithm which works well enough to show that PCP is 
strongly generically polynomial. 

Algorithm 4.3. Input an instance of the Post Correspondence Problem 
If for all i, neither m nor Vi is a prefix of the other, say "No" 
Else loop forever 

For any solution ■ ■ ■ Ui m = ■ ■ ■ vi m it is clear that one of , is 
a prefix of the other. Thus our algorithm never gives a wrong answer. 

Theorem 4.4. The Post Correspondence Problem is strongly generically 
polynomial; that is, it is in SGP. 

Proof. The size of I n is (l+k+- ■ -+k n ) 2n . If we restrict u\ to be a prefix oivk, 
then there are at most n + 1 possibilities for u\. Thus the number of inputs 
in J n in which u\ is a prefix of v\ is no more than (n + l)(l + /c + - • • + fc n ) 2n ~ 1 . 
We conclude that the number of inputs in I n for which some Ui is a prefix 
of Vi or vice- versa is at most 2n(n + 1)(1 + k + • • • + k n ) 2n ~ 1 . Dividing this 
number by \I n \ yields pj~^~j~j~fcn i which approaches exponentially fast as 
n goes to infinity. □ 

4.3 3-Satisfiability 

SAT has long been known to be easy almost all the time, and there is 
considerable experimental evidence that 3-SAT is too [4j. Thus it is no 
surprise that 3-SAT is generically easy. 

An instance of 3-SAT, i.e., an input for 3-SAT, is a finite conjunction of 
clauses 

[10' V 101 V 1] A [110 V 11' V 111] A • • ■ 
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where the variables are positive integers written in binary, and ' denotes 
negation. The problem is to decide whether or not there is a truth assign- 
ment to the variables which makes all the clauses true. 

If the eight different clauses with variables 1, 10, 11 and their negations 
all appear in the input, then the formula is not satisfiable. Thus the following 
partial algorithm is correct. 

Algorithm 4.5. Input an instance of 3-SAT 

If all the clauses with variables 1, 10, 11 occur, say "No" 

Else loop forever 

Theorem 4.6. 3-SAT is in SGP 

Start with the regular language of clauses 

R = [1(0 + 1)*(V +' V)1(0 + 1)*(V +' V)1(0 + 1)*(]+']). 

Inputs for 3-SAT are words in the free submonoid (i?A)* of E*, and size is 
word length. 

Think of inputs as words over the countable alphabet of clauses. To 
prove Theorem 14.61 it suffices to show that the set of words which omit some 
fixed clause is strongly negligible; for then the set of words omitting any 
of the eight clauses just mentioned will be strongly negligible too. Hence 
Algorithm 14.51 which searches the input for these clauses, will find them in 
linear time on a generic set of inputs. 

In fact the set of words which omit some fixed clause is asymptotically 
negligible with exponential convergence. Proof of this fact requires a using 
the Perron Frobenius Theorem to compare maximum eigenvalue for the 
incidence matrix of a finite automaton recognizing R with the maximum 
eigenvalue for the incidence matrix of a finite automaton recognizing the 
sublanguage of R which omits the eight clauses. 

5 Difficult instances 

Let V be a hard problem, say undecidable or NP-complete. The results of 
Section U] show that difficult instances of V may be rare. Sometimes we want 
to find hard instances. For example consider a cryptosystem based on an 
underlying computational problem V . The partial algorithms for V may be 
viewed as attacks on C; the hard instances are the good keys. 

How do we find hard instances? Typical existing descriptions for NP- 
complete problems use the notions of parameters and phase-transitions. See 
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for example [U Section 3] for a discussion of the location of difficult instances 
of 3-SAT. This approach is good for initial analysis, but quite often further 
study reveals that the description does not reflect the complexity of the set 
of hard instances of the problem. Our strong belief is that if the problem V 
is algorithmically hard then the set of hard instances cannot be satisfactorily 
described by parameters. 

In [18] Nancy Lynch showed that if V is a decision problem not in P, 
then one can construct a recursive subset of inputs, J C I, such that for 
any partial algorithm, A for V and any polynomial p, A succeeds in time 
p(n), where n is the size of the input, on only finitely many inputs in J. J 
is called a polynomial complexity core for V. 

Lynch's construction involves enumerating all partial algorithms, so it 
is not practical. In her paper she asks whether certain decision problems 
might admit a practical construction. As far as we know, none has been 
proposed. 

Lynch's result attracted the interest of many other researchers. See [23] 
for a recent account of subsequent work. The implications of this work for 
the theory of generic complexity are not yet known. 

6 Distributional computational problems 

In this section we generalize the definition of asymptotic density (Defini- 
tion !3.ip by allowing ensembles {/i n } of probability distributions. Each fi n is 
a probability distribution on the ball B n (or sphere I n .) Balls no longer need 
to be finite, and a subset R C I has volume density a if lim fi n (Rr\B n ) = a. 
Spherical density is defined similarly. The discussion of generic case com- 
plexity in Section [3] makes sense with the generalized definition of asymptotic 
density in place of the original definition. The original definition corresponds 
to the case of uniform distributions on finite balls and spheres. We write 
and p° for volume density and spherical density defined with respect to the 
measure /i. 

One source of ensembles {/i n } is probability distributions on I. Given a 
probability distribution, fi, on /, we define \x n for each n to be the conditional 
probability on B n or I n . We assume that \x is atomic, i.e., that is defined 
for every singleton {x}. For any subset R C I, fJ-(R) = SzG-rM^O - 

Definition 6.1. A distributional computational problem is a pair (V,n) 
where V is a computational problem and /u is a probability measure on I. 

Here is an example to illustrate how \x might arise in practice. Consider 
the following search problem from combinatorial group theory. For a fixed 
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finite presentation of a group G, the set of inputs, /, consists of all words (in 
the generators of G) defining the identity in G. It is required for each w € I 
to verify that w does define the identity by constructing a certain kind of 
proof, and a particular procedure is introduced for that purpose. The details 
are not important here, the point is that / is recursively enumerable but need 
not be recursive. How then are we to define a reasonable stratification in 
order to estimate the generic complexity of our procedure? Stratifying by 
the length of w is not useful because the resulting B n 's need not be recursive. 
The answer [22] is to define a random walk over / which stops at each point 
in / with positive probability and thus induces \x. 

6.1 Average case complexity 

Average case complexity provides a measure of the difficulty of a distribu- 
tional problem. The definition of average case complexity was motivated by 
the observation that some NP-complete problems admit algorithms which 
seem to run quickly in practice. The idea was to explain this phenomenon 
by showing that although in the maximum running time of an algorithm A 
over all inputs of size n might be very high, the average running time might 
be much smaller. For this purpose the NP-complete problem was converted 
to a distributional problem by introducing a probability distribution fi on 
the set of inputs, /. 

As average case complexity is very similar to generic case complexity, 
we will discuss the relation between the two in some detail. We begin with 
a quick review of average case complexity. We refer the reader to Levin's 
original paper [T7], which has been further developed by Gurevich [7] and 
Impagliazzo |12j . 

Definition 6.2. An algorithm A is polynomial time on /U-average if its time 
function, T4 satisfies T_^{x) < f(x) for some polynomial on ^-average func- 
tion /. The class of distributional problems decidable in time polynomial 
on average is denoted by AvP. 

It remains to define when a function is polynomial on average. A straight- 
forward definition would be the following. 

Definition 6.3. A function / : I — > M + is expected polynomial on spheres 
(with respect to an ensemble of spherical distributions {/U n }) if there exists 
k > 1 such that 
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However, in order to obtain closure under addition, multiplication, and 
multiplication by a scalar we must define a larger class. 

Definition 6.4 (|17j). A function / : I — > IR + is polynomial on /i-average 
if there exists e > such that Jj(f (x)) e a(x)~ x fj,(x) < oo. 

Which is equivalent to the following. 

Definition 6.5 ([11]). Let {// n } be an ensemble of volume distributions on 
balls {B n } of /. A function / : / — > K is polynomial on average with respect 
to {/%} if there exists an e > such that J B f e (x)/j, n (x) = 0(n). 

6.2 Average Case vs Generic Case 

Average case complexity provides a more balanced assessment of the diffi- 
culty of an algorithmic problem than worst-case complexity. Many algorith- 
mic problems, such as the NP-complete Hamiltonian Circuit Problem [9], 
are hard in the worst case but easy on average for reasonable distributions. 

Average case complexity is very similar to generic case complexity, but 
we argue that the latter has certain advantages. Generic complexity applies 
to undecidable problems as well as to decidable problems, it is easier to 
employ than average complexity, and it is a direct measure of the difficulty 
of a problem on most inputs. Average case complexity tells us something 
else. In [8] Gurevich explains, in terms of a Challenger-Solver game, that 
average case analysis describes the fraction of hard instances of the problem 
with respect to a measure of difficulty In other words to have polynomial 
on average time an algorithm should have only a sub-polynomial fraction of 
inputs that require super polynomial time to compute. 

Now we give some more precise comparisons. Our first observation is that 
AvP and GenP are incomparable; that is, GenP — AvP and AvP — GenP 
both contain functions. We leave it as an exercise for the reader to verify the 
first assertion by constructing a function which is very large on a negligible 
set and small on the complementary generic set. The second part follows 
from the next example. 

Example 6.6. Let I = {0, 1}*. For w 6 I define a(w) = \w\, the length of 
w, and define fj,(w) = 2 -2 M -1 . Consider f : I -> N defined by F(w) = 2H 
Observe that / G AvP by Definition 16.41 with e < 1, but / ^ GenP. 

However, a big chunk of AvP does lie in GenP, namely the functions 
satisfying Definition 16.31 
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Proposition 6.7. If a function f : I —* M + is polynomial on /i-average on 
spheres, then / is generically polynomial relative to the asymptotic density 
/V 

Proof. If / is an expected polynomial then there exists a constant c and 
k > 1 such that for any n 

/ f*(w)[i n (w) < cn. 

Jin 

It follows that for any polynomial q(n) 

u n {x G I n | p{x) > q(n)cn] < l/q(n), 

Now let S(f,q,k) = {x G I | /(x) > (cg(s(x))s(x)) fe } be the set of those 
instances from I on which f(x) is not bounded by (cq(s(x))s(x)) k . Then 

Hn(In n S(f,q,k)) = (J, n {x G J„ I /^(x) > q(n)cn} < l/q(n), 

therefore, the asymptotic density of S(f, q, k) exists and equal to 0. This 
shows that / is generically bounded by the polynomial (cq(n)n) k . □ 

Proposition 16.71 gives a large class of polynomial on average functions 
which are generically polynomial. 

Corollary 6.8. Let A be an algorithm for the distributional problem V. If 
the expected time of A with respect to the spherical distributions is bounded 
above by a polynomial then A G GenP. 

On the other hand under some conditions membership in GenP implies 
membership in AvP. We refer the reader to |16j . 
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